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Abstract 

In this paper we introduce and study the concept of optimal and surely 
optimal dual martingales in the context of dual valuation of Bermudan 
options, and outline the development of new algorithms in this context. 
We provide a characterization theorem, a theorem which gives conditions 
for a martingale to be surely optimal, and a stability theorem concern- 
ing martingales which are near to be surely optimal in a sense. Guided 
by these results we develop a framework of backward algorithms for con- 
structing such a martingale. In turn this martingale may then be utilized 
for computing an upper bound of the Bermudan product. The method- 
ology is purely dual in the sense that it doesn't require certain input 
approximations to the Snell envelope. 

In an Ito-Levy environment we outline a particular regression based 
backward algorithm which allows for computing dual upper bounds with- 
out nested Monte Carlo simulation. Moreover, as a by-product this al- 
gorithm also provides approximations to the continuation values of the 
product, which in turn determine a stopping policy. Hence, we may ob- 
tain lower bounds at the same time. 

In a first numerical study we demonstrate the backward dual regres- 
sion algorithm in a Wiener environment at well known benchmark ex- 
a mples. It turns out that t he method is at least comparable to the one 
in iBelomestnv et. al.l (|2009l ) regarding accuracy, but regarding computa- 
tional robustness there are even several advantages. 
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1 Introduction 



It is well-known that the evaluation of Berniudan callable derivatives comes 
down to solving an optimal stopping problem. For many callable exotic prod- 
ucts, e.g. interest products, the underlying state space is high-dimensional 
however. As such these products are usually computationally expensive to 
solve with deterministic (PDE) methods and therefore simulation based (Monte 
Carlo) methods are called for. The first developments in this respect concen- 
trated on the construction of a "goo d" exerc i se poh c y. We mention, amon; 



others , regre ssion based methods by ICarrierd (|1996h . Longstaff and Schwart d 
( 2001r). and iTsistsiklis and Van Rovl (|200ll ). the stochastic r nesh method of 
Broadie and GlassermanI ( 2004 ) . andquanti zation algorithms by Ballv arid Paged 
(|2003 !). Especially for very high dimensions. iKolodko and Schoenmakerd (|2004h 
deve loped a policy improvement ap proach which ca n be effectivel y comb ined 
with Longstaff and Schwartz (2001) for example (see Bender et al.l ( 20081 ) and 
Bender et al.l (|2006l )l 



As a common feature, the aforementioned simulation methods provide lower 
biased estimat es for the Be rmud an product under conside ration. As a new 
breakthrough. iRogerd (|2002l ). and iHaugh and KoganI (|2004l ) introduced a dual 
approach, which comes down to minimizing over a set of martingales rather than 
maximizing over a family of stopping times. By its very nature the dual ap- 
proach gives upper biased estimates for the Bermudan product and after its dis- 
covery several numerical algorithms for computing dual u pper bounds have been 
Topo sed. Probably the most popular one is the method of Andersen and Broadid 
2004). although th is metho d requ ires nested Mo nte Carlo simulation (see also 



Kolodko and SchoenmakersI ( 2004 )and Schoenmakers (2005,) ) . In a Wiener en- 



vironment , iBelomestnv et. al. I (l2009l) provides a fast generic metho d for comput- 



ing du al upper bounds which avoids nested simulations. Further iBrown et al 



(2009|) consider dual optimization via enlarging the information were an exercise 
decision may depend on. In this setting they also provide an example were a 
tight dual upper bound can be obtained by non-nested simulation. 

The algorithms for computing dual upper bounds so far have in common 
that they start with some given "good enough" approximation of the Snell 
envelope and then construct the Doob martingale due to this approximation. 
In a recent paper Rogers, (201Q) . points out how to construct a particular 'good' 
martingale via a sequence of martingales which are constant on an even bigger 
time interval. In this construction no input approximation to the Snell envelope 
is used. The me thods proposed in this paper have some flavor of the method 
of Rogerd ( 2010f ). in the sense tha t no approximation to the Snell envelope is 
involved either. In a recent paper iDesai et. al. 1 (I2OIOI) treat the dual problem 
by methods from convex optimization theory. 

The structure of this paper is as follows. Starting with a short resume of well- 
known facts on Bermudan derivatives in Section [21 we analyze in Section [3] the 
almost sure property of the dual representation in detail. There we introduce the 
concept of a surely optimal martingale, which is loosely speaking, a martingale 
that minimizes the dual representation with a particular almost sure property. 
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In this respect we will point out that a martingale which minimizes the dual 
representation is not necessarily surely optimal, and on the other hand, a surely 
optimal martingale is generally not unique. 

In Section 2] we present, as one of the main contributions of this paper, a 
characterization theorem for surely optimal martingales (TheoremlHl) . Moreover, 
we provide another result that guarantees that a martingale is surely optimal if 
it satisfies a certain measurability crite rion (Theorem [TOl). 

In applications of the algorithm of Andersen and Broadid ( 20041 ) one gen- 
erally observes that the lower the variance of the upper bound estimator, i.e. 
the closer the corresponding martingale is to a surely optimal one, the sharper 
is the corresponding dual upper bound. Actually this observation was not well 
studied from a mathematical point of view so far. In Section [5] we study this 
phenomenon and, as a next main contribution, give an explanation of it by 
Theorem [T^ and Corollary [TS] In fact, the latter corollary may be considered a 
stability statement connected to Theorem [TUl 

Guided by the new theoretical insights we develop in Section [B] algorithms 
for constructing dual martingales that are based on minimization of the vari- 
ance (respectively expected conditional variance) of corresponding dual repre- 
sentations and estimators. In this context we present in an Ito-Levy environ- 
ment a regression based backward procedure that constructs a dual martingale 
via minimizing backwardly in time the expected (conditional) variances of the 
dual estimators corresponding to the Snell envelope. We so obtain a martin- 
ga le that allows for computi ng upper bounds without nested Monte Carlo (like 
in iBelomestnv et. al. ( 2009f )l. Moreover we obtain, as a by-product, estima- 
tions of continuation values. Thus, as a result, we end up with a procedure 
that computes upper bounds as well as lower bounds simultaneously via a 
non-nested simulation procedure. The procedure is quite easy to implement 
and may be considered a s a valuable alternative to the non-nested method of 
Belomestnv et. al. where a dual martingale is obtained by constructing 



a discretized Clark-Ocone derivative of some (input) approximation to the Snell 
envelope via regression. In particular, our new procedure o nly requires regres- 



sion a t each exercise date, in contrast to the procedure of IBelomestnv et. al. 



(200i) that requires regression at each time point of a sufficient refinement of 
the exercise grid. 

In Section [71 we present a numerical study of our algorithm. We illustrate 
at two multi-dimensional be nchmark products (one of which is also considered 
in Belomestnv et. al. ( 2009 f)) a backward regression algorithm that, regarding 



accuracy and computational effort, p roduces upper bounds th at show to be at 
least of the same quality as those in Belomestnv et. al. (l2009f), and fast lower 
bounds that are overall better than in Belomestnv et. alT ( 20091 ) moreover. In 
an Appendix, we provide standard results from Statistics which are used for 
several technical arguments in Section 16.11 
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2 Bermudan derivatives and optimal stopping 



Let (Z, : i = 0,1,..., Till be a non-negative stochastic process in discrete time 
on a filtered probability space {ft,J-,P), adapted to a filtration F := {Ti : 
< i < T) which satisfies E\Zi\ < oo, for < z < T. The measure P may be 
considered as a pricing measure and the process Z may be seen as a (discounted) 
cash- flow which an investor may exercise once in the time set {0, T}. Hence, 
she is faced with a Bermudan product. A well-known fact is that a fair price of 
such a derivative is given by the Snell envelope 

Y* = sup E,Zr, < z < T, (1) 

re{i,...,T}, 

at time i = 0. In ((T]), r denotes a stopping time, Ei := Ejr. denotes the con- 
ditional expectation with respect to the u-algebra Ti, and sup (inf) is to be 
understood as essential supremum {essential infimum) if it ranges over an un- 
co untable f amily of random variables. Let us recall some well-known facts (e.g. 
see lNeveiil ^973))- 



1. The Snell envelope Y* of Z is the smallest super-martingale that domi- 
nates Z. 

2. A family of optimal stopping times is given by 

T* = mi{j ■.j>i, Zj > Yn, 0<i<T. 



In particular, 



Y*:=:E,Zr', 0<i<T, 



and the above family is the family of first optimal stopping times if several 
optimal stopping families exist. 

The optimal stopping problem ([l} has a natural interpretation from the 
point of view of the option holder: she seeks for an optimal exercise strategy 
which optimizes her expected payoff. On the other hand, the seller of the option 
rather seeks for the minimal cash amount (smallest supermartingale) he has to 
have at hand in any case the holder of the option exercises. 



3 Duality and surely optimal martingales 

We brie fly recall the dual approach proposed by iRoeerd (l2002l) and, indepen- 
dently, iHaugh and KoganI (|2004 ) . The dual approach is based on the following 
observation: for any martingale (Mj) with AIq = we have 

Y* = sup EoZr < sup Eo {Zr - Mr) < Eq max {Zj - Mj) , (2) 

Te{0,...,T} re{0,...,T} 0<j<T 



^For notational convenience we have chosen for this styhzed time set. The reader may refor- 
mulate all statements and results in this paper for a general discrete time set {To, Ti, . . . , Tj} 
in a trivial way. 
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hence the right-hand sid e provides an upper bound for Yq . iRogersI (|2002l ) and 



Haugh and KoganI (|2004l ) showed that ([2]) holds with equality for the martingale 



part of the Doob decomposition of Y*, i.e. Y* = Y^ + M* - A*, where M* is 
a martingale with Mq = 0, and A* is predictable with = 0. More precisely 
we have 

= E - ^'-i^i*) ' = E (^-1 - ^^-1^*) . (3) 
1=1 1=1 

from which we see A* is non-decreasing due to Y* being a supermartingale. In 
addition, they showed that 

Fn* = max (Z, - M*) a.s. (4) 

The next lemma, by iKolodko and Schoenmaker^ (|2006f) , provides a somewhat 
more general class of supermartingales, which turns relation ^ into an equality 
such that moreover dU holds. 

Lemma 1 Let S be a supermartingale with So =0,. Assume that Zj — Y^ < Sj, 
^ l£ j l£ T . It then holds that 

Yn* — max (Zj — S*,) a.s. (5) 
0<i<T J ^ ' 

For the proof see Kolodko and Schoenmakerk i 200d ). 

Examples 2 Obviously, by taking for S the Doob martingale as constructed in 
Lemma\^ applies. However, the Doob martingale is not the only one. For 
example, in the case Z > a.s. we may also take 

s, = {n;-i)y^, 

where N* is the multiplicative Doob part of the Snell envelope. More precisely, 
Y* = YqN*B* for a martingale N* with Nq = 1 and predictable B* with 
Bq = 1. Hence 

i=i^^-^^i 1=1 ^i-i 

Indeed, since B* is non-increasing due to Y* being a supermartingale, we have 

= ( ^ - 1 ) > (I - 1 ) - - > - < , 

thus. Lemma [7] applies again. 

The multiplicative Doob decomposition in ^ is used bv I Jamshidian ( 2007 ') 
for constructing a multiplica tive dual representation. In a comparative study. 



Chen and GlassermanI (j2007l ) pointed out however, that from a numerical point 



of view additive dual algorithms perform better due to the nice almost sure 
property (j4|). 
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Remark 3 It is not true that for any martingale M which turns into equal- 
ity the almost sure statement holds. As a simple counterexample, consider 
T = 1, Zq = 0, Zi = 2, Mq — 0, and Mi — ±1 each with probability 1/2. Indeed, 
we see that = 2 = £'o(2-Mi) = Eq max(0, 2-Mi), but, i;,* ^ niax(0, 2-Mi) 
a.s. 

In order to have a unified dual representation for the Snell envelope Y* at 
any i, it is convenient to drop the assumption that martingales start at zero. 
We then may restate the dual theorem as 

Y* = inf E, max (Z^ - Mj + M,) (7) 
= max iZ, - M* + M*) a.s., (8) 

for any i, < i < T, where M is the set of all martingales and M* is the Doob 
martingale part of Y* . 

In view of Remark [3] and Examples [2l a martingale for which the infimum 
([7]) is attained must not necessarily satisfy an almost sure property such as ([8]) , 
and, martingales which do satisfy such almost sure property are generally not 
unique. We hence propose the following concept of surely optimal martingales. 

Definition 4 We say that a martingale M is surely optimal for the Snell 
envelope Y* at a time i^ < i < T, if it holds 

Y* = max (Z, - Af,- + M,) a.s. (9) 

i<j<T ^ J J ' ^ ' 

Remark 5 Obviously, the Doob martingale of Y* is surely optimal at each 
h < i < T, and any martingale M is trivially surely optimal at i = T. 
However, it is not true that sure optimality for some i with i < T implies sure 
optimality at i + 1. As a counterexample let us consider T — 2, and Zq = 4, 
Zi ^ 0, Z2 — 2. Take as martingale Afg = 0, Mi = ±1, each with probability 
1/2, and M2 = Mi ± 1, each with probability 1/2 conditional on Mi. Then 
maxo<j<2 {Zj — Mj + Mq) = 4 a.s. Since we have trivially Yq — 4, M is surely 
optimal ati^O. But, maxi<j<2 (Zj - Mj + Mi) = 2- A/2 + Mi ^ so M 
is not surely optimal for Y* at i = \. 



4 Characterization of surely optimal martingales 

In this section we give a characterization of martingales that arc surely optimal 
for alH = 0, . . . , T. 

Theorem 6 A martingale M with Mq = is surely optimal for i — 0, . . . ,T, if 
and only if there exists a sequence of adapted random variables (Ci)o<i<T, such 
that Ei^iQ — 1, and Ci > for all < i < T, and 

i 

M, = M* ~A*+Y. ^ ^'-i) (10) 
1=1 
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where, respectively, M* is the Doob martingale and A* the predictable process 
of the Snell envelope Y* as given in (0). 

Proof, i) Let us assume that M is surely optimal as stated. Then by ^ it 
holds for any < i < T, 

Y*, = max iZ, - M,- + Afi_i) 

= max(Zj_i, Afj_i - Mi + max (Z, - M,- + Mi)) 

i<j<T ■' 

^ max(Z,_i, Af,_i - M, + Y*). (11) 

Since Zi-i < Y*_i, and since Zi^i < Yi*_i implies A*_i — A*, we obtain from 
(fTT|) and the Doob decomposition Y* = Yg* + M* - A* 

Y*_^ - = (Ai,_i -M, + Y*- 

= (A//,_i - M, + Af* - m:_, - A* + AU + YU - Z^-l)^ 

= u.-.<Y-_, - M, + Ml - A/;_i + y;_i - z,_i)+ 
+ - Af, + A/; - m;_i - 4* + . 

So we must have 

^z,-^<Y^_, {M,-i - M, + M*- Af;_i + - , and 

(A/,_i - M, + A/* - Af*_i - A* + =. 0, 

respectively. Hence we get 

lz,_i<y/_, (Af.-i - Af, + M* - A/*_i) = 0, and (12) 

(A/,_i - Af, + Af* - m:_, - A* + AU) = -lz,^^=Y;_,^^^, (13) 

for some non- negative J^^-measurable random variable /x^. W.l.o.g. we assume 
that /ii = on the set {Zi^i < Y*_^}. By taking Fi-i conditional expectations 
on both sides of ([13]), and using the martingale property of both M and M* , 
and the predictability of A* , it then follows that 

E^-ifi, = lz^_,=Y:_,E,^l^^^ - lz.-i=yr_, (A* - . (14) 

In particular, since fii > almost surely, it follows from that /i; = on the 
set {A* — A*_^} (in which < Y*_^} is contained as a subset). We next 

define 

1 (A* - ^UiV' M., if A* > ^^g^ 
1 1, else, 

and we see that we have a.s. /ii = [A* ~ ^*_i) Q- By we have (using the 
convention • cx) = 0) 

Ei^iCi = lA'^>A'^_^Ei^i {A* - A*_^) ^ ^i + Iai=ai_^ 

= 1a*>A*_j1z,_i=Y/_j + lA*>A*_jlZi_i<K*_j + 1a*=a*_j = 1, 
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since the middle term is trivially zero. We thus obtain from and ([T^ 

M,_i - M, + M*- M*_i - A* + 4*_i - - (4* - A*_^) Q, 

from which PH)) follows. 

ii) Conversely, if a martingale Af satisfies , we have for any < i < T, 

max (Z, - M, + M,) = max^ (^Z, M* + - E (^'* " 
and then by ([7]) the almost sure optimality follows. ■ 

By Theorem [6] we have immediately the following alternative characterization 
of almost sure martingales. It basically says that a martingale is surely optimal 
if the Snell envelope can be representated in a way that resembles the Doob 
decomposition but where the predictable process is replaced by a process which 
is in general only adapted. 

Corollary 7 A martingale M with Mo = is surely optimal for i — 0, . . . ,T, if 
and only if there exists an nan- decreasing adapted process N with Nq = such 
thaE 

Y* =Y* + M,-N,. 

Proof. If M is surely optimal as stated, we have by the "if" part of Theorem [6] 
(see (HOl)), 

i 

Y* - Y* - M, = - ^ (A* - Al,) C, = -iV„ (16) 
1=1 

with N being adapted, non-decreasing and Nq = 0. Conversely, if 

Y* = Y* + Ah - 

for some martingale Af, Mq ~ 0, and non-decreasing adapted iV, A^'o = 0, we 
consider for each < i < T, 

max (Z, - M, + M,) = max (Z, - Y* - N, + Y* + NA < Y„* 

i<j<T i<j<T ^ J J I. I 

and then apply ([7]) again. ■ 

We have the following remark. 

^Note that TV is not assumed to be predictable. 
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Remark 8 Let the martingale M with Mq ^ be surely optimal for i 
0, . . . ,T. For the non- decreasing process N defined by il6\) it holds that 



Y* - Ah + M,_i - Z,_i = YU -N^ + 7V,_i - =: C/„ 

and since by il6\) . Ni — Ni^i ~ [A* — A*_i^ (ij we obtain from (1111) 

iU,)+ = - a.s. 

So, in particular we have that (Ui)^ is J-i-i-measurable while Ui itself is gen- 
erally not, except for the case where M — M*. A similar observation will en- 
countered later on in (|26p . 

From Theorem ini it is clear that there exist infinitely many martingales which 
are surely optimal for all i = 0, . . . , T. In the following example we construct a 
one-parametric family of such martingales which includes the Doob martingale 
of the Snell envelope. 

Example 9 Let us assume Z > a.s. (if Z is strictly bounded from below by a 
constant —K, we may consider the equivalent stopping problem due to Z + K). 
Then Y* > a.s., and for any a, < a < 1, we consider 

Y* TV,* 
C,i :— 1 — a + a = 1 — a + a- 



E^-iY* N, 



-1 



where N* is the martingale part of the multiplicative decomposition Y* ~ YqN*B* 
of the Snell envelope (see Examples Obviously, it holds Ei^iQ — 1 and 
Q > 0, and hence, by Theorem\^ we obtain for every < a < 1 a martingale 

* / N* \ 

M, = m; - A* + E (A* - A-i) - « + «^ j 



N* 

M:-aA*+aY,{A\-AU)^ 



1=1 



N, 



l-l 



which is surely optimal for i = 0, ...,T. Thus, for q = (i.e. Q = I) we retrieve 
the standard Doob martingale of the Snell envelope, and for a — 1 we obtain 

* AT* 

Nh = Y* -Y* + Y. (^'* - ^^-i) — 



1=1 



N, 



l-l 



1=1 
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Note that this martingale differs from the martingale Yq {N* — 1) from Example\^ 
(they would coincide after dropping the factors B*_-^). It is easy to show (using 
Theorem\^ again) that the latter martingale is in general only optimal at i — 0, 
while the martingale JiTp is surely optimal for all i = 0, ...,T, hy construction. 

The next theorem provides a key criterion for identifying surely optimal mar- 
tingales. 

Theorem 10 Let Y* he the Snell envelope of the cash- flow Z and let M he any 
martingale. Then, for any i g {0, ...,r} it holds 

max (Z, - M, + M,) e =^ max (Z, - M, + M,) = Y* . 

i<j<T ■' i<j<T 

Proof. Let us suppose di := maxi<j<T [Zj — Mj + Mi) e Fi and define the 
stopping time 

n = inf {j >i: Zj - Mj + M, > J . 
By the definition of i?i we have i < Ti < T almost surely. We thus have 

Y* > E, Zr, > E, {Mr, - Af, + z90 = z9„ 

by Doob's optional sampling theorem and the fact that di € Ti. On the other 
hand we have -Qi = Et'&i> Y* due to 0. ■ 

Remark 11 While in this paper we work in a discrete time setting, it is ohvious 
that Theorem] 1 (A can he proved in (almost) literally the same way for continuous 
time exercise as well. 

5 Stability of surely optimal martingales 

In equivalent terms, Theorem [10] states that, if a martingale M is such that the 
conditional variance of 

^^{M) := max {Zj - M, + M,) 

i<j<T 

is zero for some < i < T, i.e. 

Var,79,(M) := E, (i9,(M) - E,MM)f - 0, a.s., 

then ?9i(M) = F/. Hence the martingale M is surely optimal at i. In this sec- 
tion we present a stability result for martingales M which are, loosely speaking, 
close to be surely optimal at some i, in the sense that Var^ ■di{M) is small. More 
specifically, we provide mild conditions on a sequence of martingales (M*-"^)„>i 
which guarantee that the corresponding upper bounds converge to the Snell 
envelope in a sense, although the sequence of martingales (Af'^")) does not nec- 
essarily converge. We have the following result. 
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Theorem 12 Let i e {0,...,T}. IfVaXi'dY^ 4 /orn ^ oo, where i?,^"^ := 
t?i(M(")), anrf i/m addition the sequence of martingales (M-""^ ) is uniformly 
integrable, then it holds 

Proof. Fix an i e {0, T} and suppose that the assumptions of the theorem 
are satisfied. Now take an e > 0. By introducing an auxiliary time d > T and 
setting Zg = we next define the stopping time 



inf {,] >i:Zj- Mj"^ + M^"^ > Ei^""^ - e} A 9. 



We thus have with M^"^ := M^"\ n > 1, 



= e4"^ -e-E, (m(") - Mf") + - e) l^^(„,^,j a.s., 



hence 



eM"'^ <Y* +e + E, 



1 (") a 
Ti =9 



=: y; + e + C/f h^w^^ a.s. (18) 
Now it is easy to see that the family of random variables ( f/^"' ) is uniformly 

V / n>l 

integrable too. We so may take K^> Q such that 

supi?;7i">l^<„,^^^ <e. 

n>0 * 

Further observe that due to a conditional version of Chebyshev's inequaHty, 



Since the family yEi J is bounded by 1, it is uniformly integrable. 

Hence, it follows that 

^»l{.(")=a} ^0- (19) 

We thus have 



< e + irei^ly(„)<^^l^w^9 <e + K,EE,l^(„)^g < 2e 
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for n> N^^K, by HI]). So for n > N^^k,, derive from 

^ < £:y, + e + C/i"^l^(-)^a < ^^i",* + 3e. 



Thus, 

I 

Since e > was arbitrary, 



E-dl"^ < EY* + 3e 



lini„^oo < EY* 



On the other hand, due to ([7]) we have Ei'd'f'^ > Y* a.s. for all n, so 



which finally proves Eid^"^ ^ Y^ 



(^Ed["^ - EY*) < 0, 



Remark 13 Like Theorem llCH Theorem \12\ can be formulated in a continuous 
time setting as well with (almost) literally the same proof. 

The following simple example illustrates that Theorem [1^ would not be true 
when the uniform integrability condition is dropped. 

Example 14 Take T = 1, Zq = Zi = 0, Af^"^ = 0, Af}"^ =: with £'oC„ = 
0, n — 1,2, . . . Then obviously Yq = 0, and we have 

^[."^ - max(Zo - Af^"\ Zi - Af}")) = max(0,^(")) = 

Now take 



1 with Prob. 

1 — n with Prob. — 



(hence Eo^^"'> ^ 0). Then, for n ^ oo we have Varo i?!,"^ = Eo{£}+^f - 



Clearly, for each K > 1, Eq 



oo, hence the ^Af-|"''^ are not uniformly integrable. 

In view of the next Corollary, Theorem [12] may be considered as a stability 
theorem related to Theorem [TOl 

Corollary 15 Let A4^^ be a set of uniformly integrable martingales. Then for 
any i £ {0, . . . , T} it holds: For every e > there exist a d > such that 

[M eM"' and EYm d,{M) < 5] =^ < E^i(M) - Y* < e. 
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Proof. Suppose the statement is not true for some i. Then there exists an 
Co > such that for all n S N there exists a martingale M^") £ M.^^ , for 
which £;Varii?i(M("') < 1/n and E^i{M'-''^) - Y* > cq. Since convergence in 
Li implies convergence in probability along a subsequence (indexed again by 

n) we thus have Var, 7?j(M(")) 0, and E |z9i(M(")) - X,* | > eq along this 
subsequence. This contradicts Theorem [T^ ■ 

Remark 16 Theorem \12\ is important in practical situations, for instance, for 
(possibly high dimensional) underlyings of jump- diffusion type in a Levy-Ito 
setup. In this environment we may consider the following class of uniformly 
integrahle martingales. 

Let W be an m-dimensional Brownian motion and let N denote a Poisson 
random measure, independent ofW, with (deterministic) compensator measure 
u{s, du)ds such that 



t 

2 



{u^ A l)v{s, du)ds < oo, < i < T. 

Let {Tt)o<t<T be the filtration generated by W and N , augmented by null sets. 
Now let X be a D -dimensional Markov process, adapted to (J-'t), and consider 
the mappings c : [0, T] X ^ R>o and d : [0, T] x x ^ R>o satisfying 



e( \c{s, X ds <oo, sf ( \d{s,Xs,u)\'^v{s,du)ds < 

Jo Jo Jmi 



oo. (20) 



We define the class of uniformly integrable martingales, M.^^ , as the set of all 
martingales M satisfying 

Mt = Mo + + Mf 

= Mo+ / <f''{s,Xs)dWs+ f f f'^{s,Xs,u)N{ds,du), 
Jo Jo Jw 

where cp'^ and ip'^ satisfy 

and N = N — V is the compensated Poisson measure. Note that M is indeed a 
martingale and that the expected quadratic variation of M is given by 

E[M,M]^=E [ \ip''{s,X,)\^ds + E [ [ \(p'^{s,Xs,u)\'^v{s,du)ds 
Jo Jo Jm 

<e[ \c{s,Xs)fds + E [ [ \d{s,Xs,u)\^iy{s,du)ds. 
Jo Jo Jmi 

We then have for every t Cz [0, T] , 

sup E\Mt\^ < sup E sup |Mtp < sup CE[M, M] <oo, 

MeM'-" MeM"' 0<t<T MeMVi 
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where the second estimation results from the Burkholder- Davis- Gundy inequality 
and the third estimation follows from (j20p . Finally, an application of the de 
la Vallee Poussin criterion yields that M.^^ is indeed a family of uniformly 
integrahle martingales. 

6 New dual algorithms for pricing of 
Bermudan derivatives 

In this section we consider the design of a new dual algorithm for solving multiple 
stopping problems, hence pricing Bermudan products, which are based on the 
theoretical insights from Theorem [TOl Theorem [121 and Corollary [151 In the 
following, we provide an assessment of the merits of variance minimizing dual 
algorithms based on these results. 

6.1 Merits of variance minimizing dual algorithms 

Let Q be some index set and = {M'' : q E Q} be a set of uniformly integrable 
martingales such that M contains a martingale which is surely optimal at 
i = 0. Suppose that for any q £ Q we have N samples of 'do{M'^-'^"-^), n — I, N. 
Based on these samples we may estimate Y&io'i^oiM'^) ~ Var-do{M'^) as usual 

by 

Var(^) MM') := ^ (i?o(Af«'("') - MM'^)n) , with 

ri=l 

1 N 

,9o(Af«)jv:=-^^?o(M«'(")). (21) 

n=l 

So, in principle, only two realizations {N — 2) would be enough to identify a q* 
such that 

2 2 

= Vari?o(M'?*) = Var'^^ MM"') = minV (mM"^'-''^) - MM'^)n) , 

qeQ V / 

n— 1 

and then obtain Yq — ^^{M'' ) = 'do{M'^ ''-'^'). Due to this stylized argumenta- 
tion we may expect that in a case where although the set Ai doesn't contain a 
martingale that is surely optimal at i = but at least one martingale Af? such 
that Var??o(Af') is "small enough", we only need a relatively small sample size 
N to identify this martingale, leading to a tight upper bound Y^^ := Edo{M'^). 
In the following, we formalize this idea by giving precise estimates for the vari- 
ance estimators in terms of the family of uniformly integrable martingales. 

Suppose we want to obtain an upper bound which is bounded from above 
by Yq* -|- e for some given e > 0. Consider a family of uniformly integrable 
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martingales Ai = {M' : q G Q} which is "rich enough" in the sense that there 
exists a (5 > according to CoroUary [151 such that 

{M eM: Vari?o(M) < k5} ^ 0, 

for some < k < 1. Then, in particular, we have inf^gg Var'i9o(Af') < k,8 which 
implies that for any q & Q ^nb have iJVar i?o(M9) = Varz?o(-W^) < <5- Now 
Corollary [15] again yields < i?i?o(-W) — ^o* < ^- I^^r a given set of realizations 
'i9o(-W'^"''), n — ^T--,N, q £ Q, we now may try to find an optimal q% £ Q 
arising as the solution of the minimization problem 

inf Var(^) ??o(M«) = Var^^^ MM"")- (22) 

q&Q 

For convenience, we assume that such a does exist. Furthermore, we assume 
the existence oi q° E Q which satisfies 

inf YarMM'') = Var i?o ) ■ 

Let < a <C 1 be a small threshold probability. Then one can show under mild 
conditions on the family of random variables {i?o(Af^) : q S Q}, that for some 
constant C > and quantile coefficient Cq > (only depending on a) we have 
with probability larger than 1 — a, 

VarWi?o(A^«) < Var,9o(M') (^1 + Ca^^^ , (23) 

|^l-c„y^^ Varz?o(M«) < Var^ t?o(M''), for 
(see Appendix for details). Thus, with probability larger than 1 — a, 

^1 - c„ Var i9o (Af* ) < Var^ i9o (M'" ) < Var^ t9o (A^"" ) 

< Var79o(AK) ^1 + c 

which yields 

1 + c .[^ 

Var79o(M««) < k5 

1-c 



This implies that for 



TV = inf <( n : k < 1 } (24) 

1 ~ Cqi/ — 
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we have P [Var?9o(-^'^") < > 1 - a. Thus, by Corollary [13 the dual upper 
bound due to the martingale M'^^ identified by (1^^ falls below Yq + e with 
probability larger than 1 — a. As a main feature, equation (j24p demonstrates 
that the smaller k, the fewer samples N we may choose for the identification of 

The above argumentation suggests to minimize the estimated variance of 
the dual estimator over a parametric set of martingales using a relatively small 
sample size N. However, as the parametric set of martingales M needs to be 
"rich enough" , in practice there may be many parameters involved, which in 
turn may lead to a non-convex minimization problem with many local minima. 
As a remedy to this problem, rather than directly minimizing the variance of the 
dual estimator at time zero, we propose to minimize backwardly the expected 
conditional variances EYsLTi'di{M'^) over q d Q, starting from i = T (where 
the conditional variance is trivially zero) down to i = 0, using a simple but 
effective recursive relationship between ■&i{M'') and di+i{M'^) as explained in 
the next subsection. For this backward minimization procedure the arguments 
above apply as well and moreover, as we will see, it opens the possibility for 
linear regression, hence also the possibility for fast numerical implementations. 

6.2 Backward dual variance minimization 

Motivated by Section [6.1l we now develop a backward recursive simulation based 
algorithm for the construction of a dual martingale M that yields tight upper 
bounds. In view of a such a Monte Carlo approach, we assume a Markov 
setting generated by some underlying Markov process X :— {Xt)Q^^^j, , and a 
cash-flow of the form Zj := Zj{Xj) := Z{j, Xj). First we describe the algorithm 
in a pseudo language which involves terms such as conditional expectations 
and conditional variances. Then, we spell out an implementable Monte Carlo 
algorithm where these expressions are replaced by their empirical counterparts. 

To start out on a pseudo algorithmic level we construct a martingale M 
backwardly in a recursive way by establishing that from i = T down to i = 
the expected conditional variances i? Var (M) are "as small as possible" in 
a sense that we will describe. The martingale M is such that for j > i, any 
increment 

Mj — Mi is measurable with respect to AJ^^j- :— a{Xs ■ i < s < j}. (25) 

It is easy to see that the Doob martingale of the Snell envelope meets this 
measurability property, however, in general Theorem |6] yields that there may 
exist many other surely optimal martingales satisfying this property. 

A corner stone of the whole procedure is the following recursion that holds 
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for any martingale M and any i < T, 

i9i(M) = max I Zi, max (Z.-AL+Mi)] 

\ i+l<j<T ■' J 

= max(Z„z9i+i(Af) +M, - M^+i) 

= Z, + (M) + M, - M,+i -Zi)+ . (26) 

Obviously, at every i = 0, {)i{M) only depends on (M^ — Mi).^.^j, , and 
at the starting time i = T we initially have driM) = Zt which trivially satisfies 
EYatt i'driM)) = 0. Note that if M were already surely optimal, i.e. i?(M) 
were already equal to Y* , then Remark [5] would imply that for Ui := ■di+i{M) + 
M, - M,+i - Z,, {Ui)+ = d^{M) - Z.i = Y* - Z^ is already J^-measurable. 

Now the essential idea is comprised in the following backward induction: As- 
sume that for i+1 < T we have constructed the increments (M, — Mij^i)^^^^^^^^, 
and i?i+i(M). Now the task is to find a random variable 6+i such that 

6+1 is AJ"i^i+i-measurable, i?i6+i = 0, (27) 

that solves the following minimization problem 

6+1 := argmin Var,7?,(M(e)) 

argmin S Var, (i^,+i(Af) - ^ - ^0+ ■ (28) 

Intuitively, ^i+i represents the optimal martingale increment and thus, we put 
-^j(6+i) - Afi(^i+i) := M, - Mi+i + ii+x for j > i + 1. By construction, 
the random variable 6+i satisfies ([ST)) , therefore, we obtain a set of martingale 
increments (Mj (^i+i) — Mi(^i+i))^^^-^j,, which has now been extended from 
J = i + 1 to = i and which satisfies for j > i + 1, 

Afj(6+i) - M,+i(e»+i) = Af,(e«+i) - A'/«(6+i) + M,(C^+i) - M,+i(e^+i) 



and by construction also the measurability requirement (1251) . Now we extend 
the increments (Mj — Afi+i)^^j^^^^j, from j = i + 1 to j = z by setting 

[M, - A./,),<,<T = (M,(6+i) - M^(6+i)),<,<T 

Finally we put 

d,(M) - + (7?.+i(Af) - C^+i - Z.)^ • 

After carrying out these steps backwardly from i — T down to i = we end up 
with a family of martingale increments (Mj — Mq)^^^^^, , hence a martingale 
(-^j)o<j"<T' — without loss of generality. This martingale will be 

subsequently used to compute a dual upper bound for Yq via 

Y^P = E max (Z,- - M,). 

0<j<T ^ ' 
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The key step in the above procedure is to find a solution to minimization 
problem pS]): Suppose that the martingale increments satisfying ([27)) for some 
fixed i may be parametrized as ^i+i(/3) where /? is some generic parameter. 
Based on a set of simulated trajectories of X one may then estimate for some 
/3 (which we specify in more details below) the conditional variance 

£; Var, (^?,+i(M) - e.+i(/3) " Z,)+ £; Var,f/+(/3) 

by using e.g. kernel estimators (e.g. see iLierol ( 1989f) ). and next minimize 
with respect to /?. In particular when the dimension of the parameter space is 
very small (typically one-dimensional) this may lead to a feasible Monte Carlo 
procedure. However, if the set of martingale increments ^i+i(/3) is "rich enough" 
and is moreover linearly structured in /?, that is 



K 

6+i(/3)=5]/3fcmi+\, 



fc=i 

where /3 — (/3i, . . . , I3k) G and the random variables tn|^\, k — 1, . . . , if, 
satisfy ^7} for if > 1 sufficiently large, it is in general more effective to solve 
the dominating problem 

arg mini; Var, [/,(/?) := argmini;Var, {l}^+l{M) - 6+i(^) - Z,) 

= arg mini; Var, ( ^^+l{M) ~ V /3fem|^\ ) . (29) 

V fc=l / 

The reason is twofold. On the one hand, if we succeed to find /3° G M^' such 
that ENd,XiUi{f5°) is sufficiently small (if it were zero, we would have arrived at 
an surely optimal martingale increment), then since 

arg mini; Varxr.t/+(/3) < EYtir,U+ {(3°) < EVaT,U^{(3°), 

£'Var,f/+(/3°) is generally even closer to zero and so /3° can be considered a good 
approximation to (|28p as well. On the other hand, most importantly, problem 
can be treated as a linear regression problem. 



[/3°,7°]= argmin E 



K Ki 



(30) 



fc=l k=l 

which employs an additional set of basis functions ^/^^(t, x), fc = 1, Ki. To see 
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this, note that (15(71) is equivalent with 



K 



[13° , 7°] = arg min EE, (Af ) - ^ /3feml+\ - E,d,+i (M) 

+£;,i9,+i(Af)-^7fe^fc(i,X0 
fc=i / 

= arg min | E Var, ( (Af) - V /3fcm|+\ ) 

+E (^E,d,+i (AT) - 7fci^fe (z, X,)^ 
hence f3° satisfies (|29)) . and moreover for 7° it holds 

7° =argminS fs,i?,+i(Af)- V7feVfe(*,X,)^ • (31) 
Further, the regression procedure pop delivers as by-product 

A'l 

k=i 

an approximate continuation function that may be used afterwards to define a 
stopping rule and to simulate a corresponding lower biased estimation of Yg* . 

Remark 17 (i) In virtually all practical applications we are in a setting as 
described in Remark \16\ In this environment we may model ^i+i as linear 
combinations of the form 

fc=i -^^i 

+ T.f^k ^i{s,X,,u)dN{ds,du), (32) 



fc=i 



Ti 



where Ni + N2 = K and f%{s, x) and (pf.{s, x, u) are suitable sets of basis func- 
tions satisfying the conditions in Remark\TS[ In this setting, we have 

m\% = I ^ ipl{s,Xs)dWs+ I ^ iptis,Xs,u)dN{ds,du) 

a«d/3=(/3J,...,/3^^,/3f,...,/3]*,JeM^. 
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As an alternative, we may also take 

(33) 

fc=i 

for an arbitrary given set of discounted tradables (b\''^) where the i?^'^'' = 

\ /a<j<T 

B^^\j,Xj) are provided by some specific problem under consideration. For ex- 
ample it may happen that discounted European options are available in closed 
form. In any case, iSS^) and I133\) satisfy the requirements \21^ for any vector 
parameter /3 G M^. 

(ii) Suppose that the system of basis martingale increments and basis functions 
in the regression based minimization i3(^) is sufficiently "rich" that there even 
exist f3°° and 7°° such that 

K Ki 

d.+,{M) - ^/3rm!+\ -Y,il°MhX,) = a.s. 

k=l k=l 

then one would need only one trajectory for X to identify /3°° and 7°° via LSU\) . 
This is a similar situation as discussed in Section \6.1l ' In practice when the 
system LSS^) is rich enough, a relatively low sample size will be sufficient to 
solve i30\) effectively. This phenomenon will be confirmed by our experiments 
in Section^ 

Description of the Monte Carlo algorithm 

Let us now spell out the empirical, implement able counterpart of the procedure 
described above. Based on a set of trajectories fx.-"'' ) , n ~ 1, ...,N, we 

V ■' /j=0,...,T 

carry out the following procedure. 



Step 1: At i ^T we set on each trajectory i?^"' := i?^"'(M) ZriXf') 
and ( ) - M^")) = M*") - M*") = for n = 1, N. 

\ J ^ J T<j<T 

Step 2: For n = 1, let (m'^^ -M/"]) be constructed. For 

V ■' / i+l<j<T 

i = T — 1 down to i = 0, based on the N samples, we solve the regression 
problem 

AT / K Ki \ 

^«,7« := argmin 1 ^ h?^:;) (Af ) - ^ /3,mi^f - ^ 7,^,(^, xf")) 
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We then put 

K 



and 



/c=l 



iS'fep 3: We simulate iV new independent samples fX -"-* ) 

V -J / .,=o 

1, A'^, which give rise to the new martingale samples 



j=0,...,T 



, n 



K 



i=i fe=i 

Then, an upper biased estimate for the upper bound is given by 

^o' ■= ^ Eo?.-<-, ^t'i^'"") - EE^^'^?'"^ • (34) 

n=l \ j = l fe=l / 



4- Based on the stopping rule 

ro(XO inf{* > : > ^7«i^fe(«,^.)} (35) 

k=l 

we put 



^0 -T;2^^,„(xw)(^. 



ri=l 



which yields lower biased estimate to Yr 







At this point, l e t us b riefly compare our algor ithm with the algor ithm from 
Belomestnv et. alj (|2009l) . The methodology of iBelomestnv et. alj ((2000) to 



compute dual martingales is built upon a procedure to numerically approximate 
Clark-Ocone derivatives of an approximati ve Snell envelope Y with respect to 
a Wiener filtration. The key ingredient in IBelomestnv et. ahl (|2009t ) is to ap- 
proximate this Clark-Ocone derivative on a (fine) grid tt = {to, • ■ • i^at} which 
contains the exercise grid {0, 1, . . . , T} using the estimator, 

:=^E,^.[A-M^, y.+i], (36) 
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where AJ = tj+i ~tj and ^'^Wj = Wt-^, - Wt^ . Due to ([55)) . lBeloinestnv et. al 
( 2009( ) morally requires to carry out a regression at each tj € tt on the fine grid 
77. However, our algorithm only needs to carry out regressions on the coarser 
grid of the possible exercise dates {0, . . . , T} C n. Moreover, note that 
requires as an input some approximation of Y, which needs to be obtai ned by 
another method, such as the method of iLongstaff and Schwart3 ( 2001 ). Fur- 
thermore, if the grid tt happens to be very fine, i.e. if |7r| = sup^ — tj\ = e 
for some very small £ > 0, the complexity increases, and also the right-hand 
of ([55]) becomes very large and even explode s as e approaches zero. To cir- 
cumvent these instabilities, iBelomestnv et. al. 1 (l2009l) implement regressions on 
the coarser exercise time grid and then locally interpolate on the finer grid tt. 
In our algorithm, these problems do not appear at all. Finally, we underline 
that obtaining numerically the Clark-Ocone derivative (|36p in a non- Wiener fil- 
tration (e.g. filtratons generated by Levy processes) is not so straightforward. 
In contrast, in our framework, the regression procedure (|30p may include jump 
martingales as depicted in Remark [T51 



7 Numerical examples 

In this section we present the numerical results of the backward algorithm de- 
scribed in Section [S] The performance and accuracy of our algorithm is il- 
lustrated by testing it with two benchmark examples from the literature, a 
Ber mudan basket - put on 5 as sets and Bermudan max- call on 2 and 5 assets 
(see Bender et al. ( 2006al) and IBelomestnv et. al. ( 2009() respectively). In both 
examples, the risk-neutral dynamic of each asset is governed by 

dXf = (r - S)Xfdt + aXfdWf , d=l,...,D, 

where D G N is the number of assets, Wf, d — 1, ...,0, are independent one- 
dimensional Brownian motions, and r, 6 and a are constant real valued param- 
eters. Exercise opportunities are equally spaced at times Tj = ^ , j = 0, J. 
The discounted payoff from exercise at time t is given by 

Zt{Xt) = e-^^K - ^L±_^_t^)+ for the Bermudan basket-put, 

and 

Zt{Xt) — e~''*(max(Xj-^, . . . , Xjr') — K)^ for the Bermudan max-call, 

where we denote Xt = {X^ , . . . , Xj^). For both products, the time interval 
[Tj,Tj+i], j = 0, . . . , J — 1, is partitioned into L equally spaced subintervals of 
width Ai = ^ with N = J x L. 

The implementation can be outlined as follows. We first simulate M inde- 
pendent samples of Brownian increments 

AW^^{Awl'^'^\...,Awl'^^"'''), i = l,...,N, m = l,...,M. 
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Then the trajectories of xf"^ = (X^^'^'"^ . . . , '^"^), i = 1,...,N, m 
1, . . . , M, are given by 



ioi d = 1, . . . ,D and initial data Xq = {Xq , . . . , X^ ) . 

We now carry out the backward Monte Carlo regression algorithm as de- 
scribed in Section [S) In this Wiener setting, we recall Remark [T7] (i) and 
choose as the spanning family of surely optimal martingales the Wiener in- 
tegrals m|^\ = Jt'^^ (p%{s,Xs)dWs. More precisely, we solve in a first step the 
regression problem backward in time 



M 



(^W,7«) := argmini- <"i -fl^'^ Mu.X^^^)dWi- 
fe=l 



k=l 

2 



i = r-l,...,0, (38) 



for two families of basis functions {^Pk) — ('^fc'') with cp^^"' — ip^^\ and (■(/'fc)j 
chosen as explained below. In psp the Wiener integrals are approximated by 
the standard Euler scheme, using the same Brownian increments as in (I37p . 
Finally, a new independent simulation is launched and we estimate an upper 
bound Y^^ and a lower bound Fq"™ by means of and 

As one may expect, the choice of basis functions is crucial to obtain tight 
upper and lower bounds. In this respect, special information on the pricing 
problem may help us finding suitable basis functions. One way of retrieving 
additional information is to employ martingales representations and Malliavian 
calculus techniques to obtain more specific insights into the structure of the 
pricing dynamics. We illustrate this by considering the following stylized setting: 
By the Markov property of X, we have that Et {Zt{Xt)) — f{t,Xt) for some 
measurable function f{t,x) and < t < T = Tj. Let us assume that f(t,x) 
is differentiable in x. Then, by Ito's formula and the fact that Et^Zx) is a 
martingale we have 

D „T 

Zt{Xt) ~ Et,_, {Zt{Xt)) = E ^ / Xt)XtdWt. 

Recall that -Ot — Zt and Etj_i {Zt{Xt)) can be expressed in the following 
form 

Etj_, {Zt{Xt)) = e-'-^''-'EP{Tj_,,XT,,_,;T), 

where EP{t, x; T) is the price of the corresponding European option with ma- 
turity T at time t. Thus, it is natural to choose from time T to time T/_i 
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European option values for the basis (V'fe(i) x)) and the corresponding European 
deltas multiplied by the value of the underlying asset for the basis {ipk{t,x)). 
Although for the following steps {t < Tj-i) there is no easy way to predict 
optimal choices of {tpk) and (ipk), the above analysis suggests to always include 
the still- alive European options into the basis (ijjk) and include the information 
on the European deltas into the basis (ipk)- In fact, b ased on similar arguments , 
this choice of basis functions were already proposed in Belomestnv et. al. ( 2009[ ). 



7.1 Bermudan basket-put 

In this example, we take the following parameter values, 

r = 0.05, (5 = 0, cr==0.2, D ^ 5, T = 3, 

and 

X^^ ...^X^ ^ xo, K = 100. 

We perform the simulation of the underlying asset X from (|37p with a time step 
size At = 0.01. For Tj < t < Tj+i, j = 0, . . . , J — 1, we choose the set 

{l,PohiXt),Poh{EPit, Xt;T,+i)), Poh{EP{t, X*; Tj))} 

as basis functions (i/'*:), where Poln{y) denotes the set of monomials of degree 
up to n in the components of a vector y and EP{t, X; T) denotes the (approxi- 
mated) value of a European basket-put with maturity T at time t. Recall that 
the family {tpk) serves as the regression basis for the continuation value. Further 
we choose 

/ f^, dEP{t,Xt;T,+,) \ f^, dEPit,Xt;Tj) \ \ 

I'l* dXf dXf 

as a regression basis {(fk) spanning the family of the surely optimal martin- 
gales. Since there is no closed-form formula for the still-alive European basket- 
put, we use the moment-matching method to approximate their values (see e.g. 

Brigo et al.l (12004 . and|Lord| (120061) 1. To this end. Let St ^ - ^ "' ^ 



D 



and consider another asset Gt whose risk-neutral dynamic follows 

dGt = rGtdt + aCtdWl, 

where ct is a constant. The value of the European put on this asset can be easily 
computed by the well-known Black-Scholes formula, that is, 

E[e-''^{K - Gt)+] - BS{Go, r, a, K, T). (39) 

If St and Gt have the same moments up to two, then the Black-Scholes price 
in (|39|) can be regarded as a good approximation for the value of the European 
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basket-put E {e-'''^{K - S't)+) , for details see lLordI (|2006[) . Since 



and 

E{Gt) = Goe'-^, E{Gl) = Gl^-^+'"^ , 
we can simply set 



1 ^ 



and 



The European deltas can be approximated by 

dBSdGo ^r, J ^ 1 , , 

ln(%) + (r + ^)r 

where di = = — and Af denotes the cumulative standard normal 

aVT 

distribution function. These formulas are straightforwardly extended to the 
pricing at times t > 0. 

The numerical results are shown in Table[TJ We use 1000 paths for estimating 
the surely optimal martingale and the continuation function via the regression 
procedure. Another 300000 paths are used to compute the lower bound and 
100000 paths are used to compute the upper bound. Note that we have chosen 
a relatively small number of samples (1000) for estimating the martingale in the 
regression procedure. We do so because on the assumption that the family of 
uniformly integrable martingales is rich enough, the arguments leading to (I24p 
yield that only a small number of samples are required for identifying a good 
approximation to a surely optimal mart i ngale. We compare our results to the 
price intervals obtained in I Bender et al. ( 2006a ) which are displayed in the last 



column of Table [TJ In our C-I--I- implementation, the run-times for computing 
one set of lower and upper bounds are in the range of 15-20 minutes. 



7.2 Bermudan max-call 

We use the same parameter values as in Section mi except S — 0.1 and D ^ 2 
or 5. As in the previous example we use European (call) options in the basis 
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Table 1: Lower and upper bounds for Bermudan basket-put on 5 assets with 
parameters r = 0.05, 6 = 0, a ^ 0.2, K — 100, T — 3 and different J and xq 



J 


Xo 


Low (SE) 


Up (SE) 


BKS Price Interval 


3 


90 
100 
110 


10.000 (0.000) 
2.164 (0.007) 
0.539 (0.004) 


10.000 (0.000) 
2.172 (0.001) 
0.551 (0.001) 


[10.000, 10.004] 
[2.154, 2.164] 
[0.535, 0.540] 


6 


90 
100 
110 


10.000 (0.000) 
2.407 (0.006) 
0.573 (0.003) 


10.000 (0.000) 
2.432 (0.001) 
0.609 (0.001) 


[10.000, 10.000] 
[2.359, 2.412] 
[0.569, 0.580] 


9 


90 
100 
110 


10.000 (0.0000) 
2.475 (0.0063) 
0.5915 (0.0034) 


10.008 (0.0003) 
2.522 (0.0013) 
0.6353 (0.0009) 


[10.000, 10.005] 
[2.385, 2.502] 
[0.577, 0.600] 



(ipk) and the corresponding deltas in the basis {(fik)- T he value of the E uropean 
max-call option is computed by the following formula (I.Tohnsonl (|l987h V 

C,na. = 5] / exp[--z2] l[U -z + aVT\dz 

I'^i 

-Ke-'^ + Ke--^l[{l-M{dL)), (40) 



where 



1=1 



^ ^ d\=dL+aVf. 



Moreover, straightforward computations reveal that the deltas are given by 

/ x' 



/ eM~\^^]t{^f\^-^^ + <^^\dz, (41) 



I'^l 

and that Cmax satisfies the linear homogeneitjH 



C^ax = ^ ^0^^ + K^^. (42) 



1=1 



^0 



The numerical results are shown in Tabled] They are based on 1000 paths 
for the regression procedure, 300000 paths for computing the lower bound and 
100000 paths for computing the upper bound. As before, we have chosen a 
relatively small number of samples (1000) for estimating the martingale in the 



^Compare also with l l JohnsorJ . Il987l . eq. (9)). 
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Table 2: Lower and upper bounds for Bermudan max-call with parameters 
r = 0.05, S = 0.1, a = 0.2, K = 100, T = 3 and different D and xq. 



D 


Xo 


Low (SE) 


Up (SE) 


A&B price interval 


2 


90 
100 
110 


8.0556 (0.0219) 
13.8850 (0.0276) 
21.3671 (0.0319) 


8.15655 (0.0034) 
14.0293 (0.0044) 
21.5319 (0.0048) 


[8.053, 8.082] 
[13.892, 13.934] 
[21.316, 21.359] 


5 


90 
100 
110 


16.5973 (0.0296) 
26.1325 (0.0356) 
36.7348 (0.0403) 


16.7963 (0.0058) 
26.3803 (0.0072) 
37.0856 (0.0082) 


[16.602, 16.655] 
[26.109, 26.292] 
[36.704, 36.832] 



regression procedure. This is again allowed because the arguments leading to 
((24|) and the assumption that the choice of the basis functions indeed equips 
us with a rich enough family of uniformly integrable martingales yield that 
only a small number of samples are required for identifying a good approxima- 
tion to a surely optimal martingale. The integral expressions from (j40|) and 
((1T|) are numerically evaluated using a simple adaptive Gauss-Kronrod proce- 
dure with 31 points. The pr ice intervals in the last column are quoted from 



Andersen and Broadid (2004). In our C-I--I- implementation, for each set of 



lower and upper bounds, we observe run-times that are in the range of 10-25 
minutes, with the longer computation times for the 5-dimensional case. 



Concluding remark 

The numerical results presented in Tables [U [2] due to our new algorithm may be 
considered as very satisfactory given the decreased computation times (which are 
in the order of minutes in a C-I--I- compiled implementation). In this respect it 
should be noted that computing upper bounds (in a rath er generic way) in orde r 
of minutes is a considerable improvement compared to iBender et all (|2006al ). 
whose upper bounds are computed with nested Monte Carlo si mulation requirin g 
higher computation time, and of comparable range to .Belomestnv et. al.l(|2009^ ■ 
Moreover, the algorithm delivers fast and surprisingly good lower bounds while 
the upper bounds ar e about the sam e range as the ones obtained with the 
al gorithm in .Belomestnv et . al. ( 2009f) . Needless to say that, as for the method 



of lBelomestnv et. al.l (|2009l) . the performance of the here presented algorithm 
will highly depend on the choice of the basis functions. An in depth treatment 
of this issue is considered beyond scope however. 



8 Appendix 

We present in this section some well-known facts from theory of empirical pro- 
cesses which are used to establish the relation in Section [01 

Let {^"^ : q e Q} be a family of random variables and let for each q £ Q, 
t?'^^, 1?'^'^ be i.i.d. samples of For each q E Q we consider the unbiased 
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variance estimator 



N „ , N 



VarW^^:=^^(^^--^^)' with := 1 ^ 



n—l n—1 



hence iJVar^^-'t?' = Ya.ri)'^ for q <E Q. From standard statistical theory it is 
well known that with ji'^ := Ed'' 



Var (Var(^) < j^E - ^i^f = 



4 (Vari??)^ (d'i-^i'i^'^ 
E 



N V War 
Now, as a mild condition we assume that, 

/ ,99 _ ,,9 \ 
E ^ < C for all qcQ. 



V War di 

For example, this holds if £^exp [A ji^^l] < qq for some A > and all q ^ Q. We 
so have in particular 



Var 



(^Var(^) i?"^" ) < (var i?'" ) ^ and Var (var^^^ i9«° ) < (var ) ^ 



while, strictly speaking, the randomness of "(9'" in the first inequality is ignored. 
However, by considering a next from i^^w independent sample, we can show that 
this is not really essential (the details would go beyond the scope of the optional 
analysis of Section 16.11 and are therefore omitted) . From standard empirical 
probability theory it now follows that for any (small) < a ^ 1, there is a 
suitable quantile coefficient Cq (particularly not depending on N) such that 

Var^^) i?« < Var z9« ^1 + c„ y'^j , Var i9« < Var^^^ 7?« + c„ ^^Var 1}'' 

> 1 - a, 
for q e {q%,q°} which implies 
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